A Pitch-Asynchronous Simple Method for Speech Synthesis by Diphone Concatenation using the Deterministic plus Stochastic Model
نویسندگان
چکیده
One of the most common approaches to speech synthesis is the concatenation of diphones, extracted from a previously recorded database. The prosodic parameters of the recorded speech fragments have to be adapted to the specifications of the new utterances to be synthesized. In this paper, the deterministic plus stochastic model of speech is used to modify and smoothly concatenate the analyzed diphones. A very high quality is reached without pitch-synchronism, and complex calculations like the vocal tract estimation are avoided. Instead, simple linear interpolations and fast calculations are performed, and only harmonically related sinusoids are taken into account. The resynthesis of the concatenated data is carried out by the overlap-add method.
منابع مشابه
Efficient Speech Synthesis System using the Deterministic plus Stochastic Model
In this paper, a high-quality concatenative synthesis system using the deterministic plus stochastic model of speech is described, in which the prosodic modifications are performed by means of very simple and efficient operations, as we reported in a previous work [11]. In particular, pitchsynchrony is not necessary, and linear interpolations substitute other types of estimation. The method for...
متن کاملDiphone concatenation using a harmonic plus noise model of speech
In this paper we present a high-quality text-to-speech system using diphones. The system is based on a Harmonic plus Noise (HNM) representation of the speech signal. HNM is a pitch-synchronous analysis-synthesis system but does not require pitch marks to be determined as necessary in PSOLA-based methods. HNM assumes the speech signal to be composed of a periodic part and a stochastic part. As a...
متن کاملA biphone constrained concatenation method for diphone synthesis
Diphone concatenation [1] has the advantages of simplicity and a relatively small database of speech when compared to other concatenative synthesis methods (e.g., [2]). However, diphone concatenation faces two notable problems. The first is coarticulation which extends beyond the scope of a single diphone and entails some degree of contextual mismatch for virtually any diphone in at least some ...
متن کاملPitch Contours as Predictors of Audible Concatenation Artifacts
This paper deals with the traditional problem of the occurrence of audible discontinuities at concatenation points at diphone boundaries in the concatenative speech synthesis. While most of the related studies put stress on the spectral component, we focused on the pitch contours and their role as predictors of the discontinuities. To measure the amount of information contained in the pitch con...
متن کاملSynthesis and Control of Synthesis Using a Generalized Diphone Method
Generalized Diphone Control is a powerful means of building a musical phrase from dictionaries of analysed sound units by building sequences of units and concatenating and articulating them. ~rough a graphical user interface on Macintosh, the Diphone 2.0 software provides analysis, control and synthesis according to various models, such as the Sinusoidal Additive model and the Chant model. A la...
متن کامل